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RELATED CASES 

Reference is made to commonly-assigned copending U.S. Patent 

Application Serial No. (Attorney. Docket No. D/A2456Q), entitled A 

SYSTEM AND METHOD FOR DIGITAL WATERMARKING IN A CALIBRATED 
PRINTING PATH, by Ricardo L. de Queiroz et al, the disclosure(s) of which are 
5 incorporated herein. 

FIELD OF THE INVENTION 
The present invention generally relates to watermarking methods and, 
more particularly, to those methods for digital watermarking which embed hidden 
data in a manner designed for the printing process having high information 
channel capacity while not degrading image texture or resolution 

BACKGROUND OF THE INVENTION 
10 For many reasons, it is desirable to enable data hiding within a digital 

image. Those reasons include image security, authentication, covert 
communication, rendering instructions, and providing additional useful 
information. Some existing digital watermarking methods do not survive the 
printing process as most watermarking methods are designed for use in a non- 
15 printed contone image. Existing methods designed for contone images are too 
fragile to be encoded into a print and be retrievable. Glyphs and other low 
frequency methods that can be used in a print setting often introduce 
undesirable textures or lower the spatial resolution of the image. 

Many printing processes produce color by combining 4 or more colorants. 
20 The most common case is the use of cyan, magenta, yellow, and black 
colorants, referred to herein as C,M,Y,K. It is well known that these 4 colorants 
can be combined in many different ways to produce the same color, as 
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perceived by the human visual system. For a given image or application, the 
specific method of combining C, M, Y, K is determined by the gray-component 
replacement (GCR) function. This function is, in general, a multidimensional 
mapping that relates the basic subtractive primaries, C, M, Y to the actual C, M, 

5 Y, K colorants that constitute the final printed image. A simple and common 
form of the GCR function is defined by a pair of functions: one that relates 
min(C, M, Y) to the amount of K to be added, and another that relates min(C, M, 
Y) to the amount of C, M, Y to be removed or replaced by the K. The two 
functions are dependent upon each other in a way that yields the desired color. 

10 Note that the quantity min(C, M, Y) is the amount in which all three colorants are 
present, and thus is qualitatively thought of as the "gray component" of the color. 
Functions of this quantity that dictate the amount of K to be added and the 
amount of CMY to be replaced are thus collectively termed "gray component 
replacement". 

15 In principle, a given image can be rendered with different GCR functions 

to yield visually similar or identical color renditions. Of these a single GCR 
function is generally chosen to render a given image or object within a page. 
The choice is based on a number of factors, including achievable print gamut, 
colorant area coverage limitations, smoothness and texture considerations, 

20 sensitivity to registration, colorant cost, etc. 

What is needed in the art is a watermarking method designed for use in 
the printing process which does not degrade textures and which has a high 
information channel capacity. 

BRIEF SUMMARY 
What is disclosed is a method for digital watermarking in a calibrated 
25 printing path and comprises: first receiving a pixel possessing color values from 
an input image; receiving a plurality of information bits to be encoded at a 
corresponding pixel in an output image. Then, one of at least two different GCR 
functions are selected where the selection is based on the state of the received 
information bits. The number of GCR functions to be selected from is dependent 
30 on the number of possible states of the information bits intended to be encoded 
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at each image pixel and preferably equals the number of states of the 
information bits intended to be encoded at each image pixel such that the GCR 
spatially varies across the output image. Further, at least two GCR functions are 
optimized to carry information and information bits intended to be encoded within 
5 the output image are represented with a tag. CMYK values are then generated 
using the selected GCR function and the color values. These CMYK values are 
assigned to a corresponding pixel in the output image. The information bits to be 
encoded at a given pixel indicate the type of object to which that pixel belongs, 
such as: graphics, picture, text, line art, etc.. The output image, when printed, 

10 exhibits the property that substantially similar colors occurring at different spatial 
locations in the input image are produced with substantially different CMYK 
combinations in the print. Preferably, a parameterized function is used for the 
GCR function and the encoded state sets the parameter of the function. 
Information bits should be redundantly encoded throughout the output image. 

15 Regions that cannot be encoded by GCR information have to be compensated 
for. A reference mark is applied on the output image to indicate the starting point 
and order of the information sequence. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The preferred embodiments and other aspects of the invention will 
become apparent from the following detailed description of the invention when 

20 read in conjunction with the accompanying drawings which are provided for the 
purpose of describing embodiments of the invention and not for limiting same, in 
which: 

Figure 1 illustrates the method of the present invention of inserting a 
watermark; and 

25 Figure 2 illustrates the method of the present invention of extracting the 

watermark. 

DESCRIPTION OF THE SPECIFICATION 
A method is disclosed for encoding or digital watermarking (data hiding) 
for use in the printing process. Attention is respectfully directed to Fig. 1 . An 
input pixel 10 is received in an RGB color space or an other color space, which 
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has been converted to CMY values, at 12. Alternatively, the input pixel is 

received as a CMY pixel. 

Two or more OCR functions are loaded into GCR processor 14 (shown 

referenced as GCR-1, GCR-2, GCR-/V). For A/-GCR functions, information 
5 16 to be transmitted is converted therein into /V-ary form. The N-ary form is one 

wherein the information at each pixel can be represented by one of AMevels. For 

example, if two GCR functions are to be used (/V=2), the A/-ary form for the 

information is binary, so that each pixel is represented by one of two levels [0, 1]. 

One might choose GCR1 = 100%GCR (maximum amount of gray component is 
10 replaced with K) to encode every 0, and GCR2 e 0%GCR (no K is used to 

replace the gray component) to encode every 1 . Expressed analytically, GCR1 

and GCR2 of this simple example perform the transformation: CMYK! = (C- 

min(C,M,Y), M-min(C,M,Y), Y-min(C,M,Y) , min(C,M,Y)); for GCR-1 s GCR 100 %; 

CMYK 2 = (C, M, Y, 0); and for GCR-2 = GCR 0 %; where C, M and Y on the right 
15 side of the equations indicate values into the GCR processor 14, and C, M, Y, 

and K on the left side of the equations indicate values output from GCR 

processor 14. 

From the same CMY data input to GCR processor 14, more specifically, 
from the same CMY data at a particular image location, two or more alternative 
20 CMYK sets can be generated and output from GCR processor 14. The particular 
CMYK set utilized at the given location indicates the information state encoded at 
that location. For instance, using the example of 0% and 100%GCR from above, 
CMYK = (C,M,Y,0) at a particular location would have a 1 encoded at that 
location. 

25 The information bits 16 are arrayed in a sequence that produces a spatial 

orientation and layout when the CMYK values are printed. For instance, an 8x8 
pixel block of CMY data may receive the encoding of the first /V-ary number 
formed from the information bits. The second 8x8 block may receive the second 
N-ary number, and so on. Different block configurations may be used, or 

30 encoding may follow other space filling paths, such as along a space filling 
curve. Also, the encoding could be adaptive to image content, so that particular 



4 



image features, such as letters, graphic objects, pictures, etc., provide the spatial 
support for the encoding process. Since the data is arrayed spatially in the 
image, in some cases it is useful to have a reference point on the page from 
which the sequence of information can be read. The reference point could be a 
corner, a specially designed cross or dot pattern, or some agreed upon image 
feature. 

There are several ways that GCR processor 14 can convert input CMY 
values to output CMYK values. One example of operation processes the 
incoming data at a location only with the GCR function selected by the 
information bits (N-ary number) to be encoded at that location. Another method 
processes the incoming CMY data with all GCR functions, and the information 
bits associated with that location in space select the CMYK data to be used in 
forming the image. The CMYK sets formed thereby are printed to form output 
image 1 8. The information bits are now embedded onto the output print. 

If the printing path has been properly calibrated, printed CMYK sets, 
associated with GCR functions, will look essentially the same under the 
illuminant for which the calibration was performed. The GCR functions should 
be optimized so that they are easily differentiated from each other in the 
subsequent decoding process, while at the same time, producing colors that are 
visually substantially similar under the calibration illuminant. 

It should be appreciated by one skilled in this art that some regions in the 
output image will be less useful than others for embedding information using the 
present method. This includes regions where little or no GCR is necessary (e.g., 
C=0 or M=0 or Y=0) and very dark regions because it is more difficult to detect 
black pixels from high-resolution scans due to excessive dot overlap. These 
regions may be considered to have low or zero channel capacity for information. 
Various methods can be used to avoid information loss that could occur by 
attempting to encode information in these regions. For instance, the information 
could be encoded redundantly, throughout the image, thereby increasing the 
likelihood of transmitting the information in the print. Otherwise, low information 
capacity regions can be recognized by their input CMY values, and those regions 
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can be skipped in the encoding processing, and encoding would resume when 
the next region was input to GCR processor that had sufficient information 
capacity. 

Although current color management software found in the arts allows 
5 different GCR functions to be applied to different objects, the purpose therein is 
to enable the optimization of image quality whereas, the present invention uses it 
convey information. The present encoding method is described as spatially 
varying, as in varying across a pictorial image. It should be understood that the 
GCR may also vary with objects on a page. For instance different graphical 

10 features, letters, vignettes, and the like could each have a different GCR that is 
optimized to carry information. The watermark information can be a tag plane, or 
auxiliary image plane sometimes referred to as an omega channel or alpha 
channel, that is used to select the GCR functions in the encoding. 

With reference now being made to Fig. 2, to decode the embedded 

15 information, the printed document 20 is scanned into electronic form. The data 
contained therein are converted to RGB color space by this scanning process. 
The K colorant amounts are then estimated from the RGB data. By design, two 
or more colors match visually even though they were produced with different 
GCR strategies and thus have different K amounts for the same colored area. 

20 Thus, to retrieve K values from RGB data, information in addition to colorimetric 
values is used. In the present invention, the non-overlap properties of rotated 
halftone screens provide this additional information. 

The K-colorant signal, denoted K H , is estimated from a high-resolution 
scan of the RGB image. In one embodiment, this K-colorant signal is evaluated 

25 conditional to a capacity signal K L which relates the capacity of pixels to possess 
K-toner to luminance L. The importance of deriving K L is to normalize the value 
of K H against some reference point. Different GCR strategies generate images 
that may look different except under some illuminant for which the color 
correction was derived, i.e., illuminant metamerism might occur between the 

30 viewing illuminant and the scanner illuminant. Thus the GCR (and hence, the 
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embedded bit data) is estimated by analyzing the vector { K L , K H , L}, i.e., the 
value of K H given K L and L. 

In an alternate preferred embodiment, the K-colorant signal, K H , is 
evaluated conditional to a low-resolution colorimetric signal from the image, 
5 RGB. In other words, RGB is a low resolution measure of the color of a local 
area and provides a measure of the capacity of that area to possess K-colorant. 
The GCR (hence, the embedded bit data) is estimated by analyzing the vector 
{R, G, B, K H }, i.e., the value of K H given R, G, B. 

The scanned RGB values may be inverted to obtain CMY estimates: C=1- 
10 R, M=1-G, Y=1-B. One can then estimate K H = min(C,M,Y) = 1-max(R,G,B). If 
S is a suitable operator to reduce the image from the scanner resolution to the 
watermark resolution (e.g., a sequence of blurring filters followed by sub- 
sampling), then Kh = S(min(C,M,Y)) = S(1-max(R,G,B)). Even though K H 
resembles that for generating K in a 100% GCR strategy, its present use is 
15 limited to high-resolution scanned data where the printer dots are discernible. 

In an alternate embodiment, scanner RGB values can be converted to an 
estimate of CMY by a calibration of the scanner system. For example, a 3x3 
linear transformation matrix can be used to best relate scanner RGB values to 
actual known amount of cyan, magenta, and yellow used in the print. It will be 
20 appreciated by those skilled in the art that other transformations can be used to 
relate scanned RGB to printed CMY. This transformation can be used to 
improve the estimation of K from scan RGB in the actual embedded prints. 

In the alternative, spatial properties of the high-frequency image are 
exploited to determine K H . Assuming halftoning with rotated screens, CMY dots 
25 do not frequently overlap completely in the highlight and mid-tone regions. 
Typically, the amount of overlap of CMY dots covers a small area percentage at 
highlights and mid-tones. If the input image has perfect dots and a scanner that 
has high enough resolution that it is able to perfectly scan the dots and their 
overlaps is used, then the scanned RGB values thereof only assume a small 
30 number of combinations that depend on the geometry of the dots against the 
scanner resolution. If each of the RGB values for any ^iven pixel in the scan is 

> 
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low enough then that pixel essentially comprises all black color. A black-colored 
pixel can then be assumed a K-dot. This K-dot assumption is a good 
approximation because for low RGB there must be either a K-dot ' or an 
overlapping of CMY-dots. Because the overlap area of CMY-dots is often small 
5 in highlights and mid-tones (due to rotated screens) it is therefore probable that 
the black-colored pixel is a K-dot. Additional information, for example, ensuring 
that the pixel has very low chroma, is also useful. Preferably, a combination of 
luminance and chroma can be used to distinguish between a K dot and an 
overlap of CMY dots. 

10 Additionally, at the dot center, a small K dot on a light background will 

have a lightness that is higher than a larger K dot on a darker background. Thus 
a transformation is necessary to normalize the lightness of the dot based on the 
scanned RGB. This is done as follows. One out of /V-GCRs can be estimated 
by first, in the training session, determining one image or region that was 

15 processed with each GCR. For each region or image, compute DOi, K L , 
LDDDE(K H | K L , L, GCR=i), i.e., the average value of K H for each (Kl, L) pair. In 
the on-line detection phase, find the DDthat is the closest to the received K H . In 
the binary case, there are only two values to compute, DDI, K L , LDOOand 

□ □□□ Kl, LDDD so that it is easier to set up threshold: DDKl, L) = [DDI, K L , 
20 LDD+ □□□□ K L , such that K H is simply compared to a threshold 

□ □Kl, L). The array of the thresholds (typically 256x256) is preferably setup 
beforehand to simplify the computation, i.e., detection can be made with one 
look-up and one comparison. Thus the determination of which pixel is 
associated with a K dot depends on the average lightness of the region. 

25 Because Kl relates to RGB in an average colorimetric way and does not 
comprehend microscopic halftone dot geometry, it can be reduced (using S as 
defined) to: K L = 1 - max(S(R ), S(G), S(B)). The luminance value from the low- 
resolution scans is: L = a S(R ) + b S(G) + c S(B). 

In the alternative, for decoding the GCR-based watermark, the function 

30 DDi, R, G ,BD □□which relates the expected amount of K to the scanned RGB 
signal, is obtained through a separate calibration process, as follows. A set of 
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CMY values is generated, preferably as a 3-dimensional grid in CMY space. 
These values are processed through each of the A/-GCR functions to obtain N 
sets of CMYK values. If other functions, such as ink-limit, and tone reproduction 
curves (TRCs), are applied during nominal printing of images, these functions 
5 are to also be applied to the CMYK data. Subsequently, from each CMYK data 
set, a target of patches is generated, printed and scanned. The estimate D0\O 
R,G,BDDDi=1 J ...,N, of the amount of black colorant in each patch is obtained for 
each of the N scanned targets corresponding to the A/-GCR functions. One 
exemplary method for doing this has been described previously. The black 

10 estimates are then used to populate a 3-dimensional lookup table (LUT) whose 
inputs are the scanned RGB values, and whose outputs are the N values of □ 
corresponding to the A/-GCR functions. 

Given a scan of an arbitrary printed image, the black estimate K H is 
derived using the same algorithm employed in the aforementioned calibration 

15 procedure. The RGB values of the scanned pixels are used to index the 3D 
LUT, and K H is compared with each of the N number of □ values. The closest □ 
value determines which of the A/-symbols was encoded at the given image 
location. 

In the case where A/=2, one can alternatively fill the 3-D LUT with the 
20 threshold: □□ R, G, B) = [□□!, R, G, BDDD+ □□□□ R, G, BD □ □ DThe 

received black estimate K H is then simply compared with DDR, G, B) to 
determine which of the two symbols was encoded at the given image location. 

It is to be appreciated that methods for filling 3-D LUTs given an arbitrary 
sampling of input and output data is well known in the art. See for example: R. 
25 Bala, DEVICE CHARACTERIZATION , Digital Color Imaging Handbook, Chapter. 
5, by: G. Sharma, Ed., CRC Press, 2003. It is also noted that pre-conditioning 
transforms may be applied prior to the 3-D LUT that increases the efficiency and 
accuracy of the 3-D LUT. For example, a 3x3 pre-conditioning matrix that 
approximately maps scanned RGB to printer CMY can be followed by a 3-D LUT 
30 that maps the said printer CMY to the D-values. Since the original calibration 
targets are designed as grids in CMY space, such a preconditioning transform 
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should result in a better utilization of nodes in the 3-D LUT. Also it attempts to 
explicitly recover the original GCR, which is a mapping from CMY to K, rather 
than a mapping from scanner RGB to K. 

In an alternative look-up-table method for decoding, the system is similarly 
5 trained using a test target, and for every pixel, a quintuple: [R, G, B, K, Q] is 
organized where Q is a discrete number telling which GCR was used to print that 
pixel. The RGBK hypercube is mapped into one number Q. The training 
method works by simple majority: The RGBK hypercube is divided into N number 
of cells. Then for every pixel in the printed-and-scanned image, KH and RGB 
10 are determined. The appropriate Q value is associated with the cell. This is 
repeated for each pixel in every target. After entering the values for every cell, a 
histogram of Q values is determined. The most popular Q value is entered at 
that cell location. Then a LUT is constructed with the RGBK values as input and 
Q as output 

15 Thus each cell maps the RGBK value to its most likely GCR strategy. To 

ensure that no cells are empty, we start by running the above training algorithm 
with partition of RGBK into 2x2x2x2 cells. If any of the cells is empty, a random 
Q value is assigned to it. The process is repeated for the 4x4x4x4 cell partition. 
At the end of the process, any empty cells inherit the Q value from its 

20 corresponding position in the 2x2x2x2-partition stage. For non-empty cells, the 
4x4x4x4 value is used. The process is repeated for 8x8x8x8. Any empty cells 
inherit the corresponding Q value from the 4x4x4x4 partition stage. The process 
is repeated again to obtain 16x16x16x16 cells, inheriting Q values from the 

8x8x8x8 partition stage. The 16x16x16x16 partition is probably accurate enough 

ii 

25 to map RGBK values to a small number of GCR functions. The run-time 
detection algorithm works as follows. In the high resolution image, estimate K H . 
Reduce K H image to low resolution. Then, reduce RGB scanner data to low 
resolution. For every pixel in the low resolution image, compute the RGBK H 
quadruple. Feed that RGBK H into the cell LUT, retrieve Q which is the GCR 

30 estimation. 
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While particular embodiments have been described, alternatives, 
modifications, variations, improvements, and substantial equivalents that are or 
may be presently unforeseen may arise to applicants or others skilled in the art. 
Accordingly, the appended claims as filed and as they may be amended are 
5 intended to embrace all such alternatives, modifications variations, 
improvements, and substantial equivalents. 
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